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Sequence Description score E-volue N 

7tri~l ~~ PF000~07 ~7 transnenbrane receptor (rhodopsin 1*19,9 4776-37 "T 
Parsed for domains! 

Sequence Dona in se_q-f seq-t hnn-f hnn-t score E-value - 
7tnjf ITT" '57' 321 I ~259 CD 11979 OflT 

AUgnnents of top-scorin9 donainss 

7tn 1: donain 1 of 1, fron 57 to 32h score 119.9, E = 4, 7e-37 

*->GNlLVilvilrtkklrtptnif i INLAVADLLf lltlppwalyylvg 
GN+ ++++++r +++r +t +++1NL ADLLf + p++ ++ -+ 
Flhl4273, 57 GNVCAL VL V AR-RRRRGATACLVLNLFCADLLF I SA I PL VL A VR- WT 101 

gaadWpf Gsa 1 Ck 1 vta I d vvnnyaS i 1 1 Lta 1 S i DRY 1 A 1 vhP 1 ryrrr 
e W++G++ C+1+ +++++++ + il+L+a S++R + Iv 1+ +r 
F1M4373, 1 02 — EAWLLGPVACHLLF Y YMTLSGS VT I LTLA A VSLERMVC I V-HLQRG VR 1 48 

rtsprrAkvv i 1 1 vwvlal 1 lslPpl If swvktveegngtlnvnvevCl i 
+r +V+++1+W +++++1P +f+ v+ ++ ++ ++ +C++ 
Flhl4273, 149 GPGRRARA VLL AL I VGYS A V AALPLC VFFR V VPQRLPG— ADQE I S I CTL 196 

df pccstasvstw Irsyvl IstlwgFl 1P1 1 v i IvcYtrl Irtlr 

+P++++ ++s+ +++ ++ Fl+P lvi++ Y+ II + + ++++ 
Flhl4273, 197 IWPMPG--^r£ISWBVSFVT^ 240 

kaaktllvYvvvFvtCWlPyf ivllldtlc 

+ + ++++++ ++++ ++ +tl++++v F++ W P I++11 + 
Flhl4273, 241 lcvsfay^ 290 

. lsi insstCelervipt<3illvtlwLayvNsclNPilY<-x 
-+ + +.fj.++++ + ++++Nq+1NPi+Y 

Flhl4273, 291 nFK QDL V I WPSLFF VVV APTF ANS ALNP I L Y 321 



FIG. 2. 



1QQ7S987 



)7EeOS 




5/15 



CO 




o 

-10 

CO 

lO 

-CM 
CO 



CO 

f£ 

CM 
CM 

m 

<M 
CM 

O 




Analysis of Flh14273, (362 aa) 
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■4.9 



■6.5 3.2 1 



| i | i | i | i | i | i | i | i | i | i | i | i| r | i | i | i | i | i | 
1 41 81 121 161 201 241 281 321 361 



>Flhl4273, 1086 bases, 1825 checksum. 

MSPECARAAQDAPLRSLEQANRTRFPFFSDVKGDHRLVLAAVETTVLVLEFAVSLLGNVC 
ALVLVARRRRRGATAQLVLNLFCADLLFYSAIPLVLAVRVTEAVLLGPVACHLLFYVMTL 
SGSVTILTLAAVSLERMVCIVHLQRCVRGPGRRARAVLLALIVGYSAVAALPLCVFFRVV 
PQRLPGADQE I S I CTL I WPT I PGE 1 SW5VSF VTLNFb VPGL V I V I S YSK | LQ I TK ASRKR 
LTVSLAYSESHQ I RVSQQilFRLFRf tftlHVSFFl MWSPl i ITILLILI QNFKQDL V I VP 
SLFFy VV AFTF ANSALNP I L YNMTLCRNEVKK I FCCPUFGPEKGA I LTDTS VKRNDLS 1 1 S 

FIG. 4. 
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Transmembrane Segments Predicted by MEMSAT 



Start II End II Orient 

II II 


Score 


46 || 66 || out->ins 


5.1 


| 75 || 98 || ins->out 


4.0 


| 113 || 134 || out->ins 


4.0 


1 156 J 177 J| ins->out 


4.9 


[209 || 227j| out->ins 


3.7 


1 266 J 289jfins->out 


6.5 


297 f321 J| out->ins 


3.2 



>Flhl4273, 
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ALVLVARRRRRGATACLVLNLFCADLLFISAWPLVLAVRWTEAWLLGPVACHLLFYVMTL 
SGSVT I LTLAAVSLERMVC I VHLQRGVRGPGRRARAVLLAL I WGYSAVAALPLCVFFRVV 
PQRLPGADQE I S I CTL I WPT I PGE I SWDVSFVTLNFLVPGLV I V I SYSK I LQ I TKASRKR 
LTVSLAYSESHQIRVSQQDFRLFRTLFLLMVSFFIMWSPI I ITILLILIQNFKQDLVIWP 
SLFFWVVAFTF ANSALNP I LYNMTLCRNEWKK I FCCFWFPEKGA I LTDTSVKRNDLS I I S 
G 

Transmembrane Segments for Presumed Mature Peptide 



Start | 


EndJI Orient j 


Score 


14 | 


37 J| in*P$j£' 




52 | 


73 || out->ins 


4.0 *. 


95 j 


116jfins->6Uf" 


4.9 


148 


166 || out->ins 


3.7 


205 


228 || ins->out 


6.5 •: 


236 


260 || out->ins 


3.2 " 



>Flhl4273, .nature 

LVLVARRRRRGATACLVLNLFCADLLFISAIPLVLAVRWTEAWLLGPVACHLLPYVMTLS 
GSVT I LTLAAVSLERMVC I VHLQRGVRGPGRRARAVLLAL I WGYSAVAALPLCVFFR VVP 
QRLPGADQE I SICTL I WPT I PGE I SWDVSFVTLNFLVPGLV! VI SYSK ILQITKASRKRL 
TVSL A YSESHQ I R VSQQDFRLFRTLFLLMVSFF I MWSP 1 1 1 T I LL I L I QNFKQDL V I WPS 
LFFWVVAFTFANSALNP I LYNMTLCRNEWKK I PCCFWFPEKGA I LTDTSVKRNDLS 1 1 SG 

FIG. 6. 
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Query! 14273m, 
Scores for sequence fan! ly classification (score Includes all domains)* 
Sequence Description Score E-value N 

7tnT PFOOOQl" 7 transnenbrane receptor (rhodops In 11178 le-35 ~T 

Parsed for domains* 

Sequence Domain seq-f seq-t hmmj-f hmm-t score Ej value - 
ftnjf 1/7" "57" ~32f " "I """259 C ] fial fe -36 
Alignments of top-scoring domains! 

7tm 1* domain 1 of 1, from 57 to 321: score 118.8, E = le-36 

x->6NlLVilvllrtkklrtptnifllNLAvADLLflltlppwalyylvg 
GN+ ++++++r +++r ++ +++1NL ADLLf+ + p++ ++ ++ 
14273m, 57 GNVC AL VL V AR-RRRRGASASLVLNLFCADLLFTS A I PLVLV VR-WT 101 

gaadVpf Gaa 1 Ck 1 vta 1 d vvnmy aS i 1 1 1 Lt a 1 S i DR Y 1 A 1 vhP 1 ryrrr 
e V++G+++C+1+ +++++++ + ll+L+a S++R + Iv lr + 
1 4273m, 1 02 — E AWLLGPV VCHLLP Y VMTMSGS VT I LTL AAVSLERMVC I V-RLRRGLS 1 48 

rtsprrA. kvv i 1 1 vWvlal 1 lslPpl If swvktveagngtlnvnvtvCl 
rr+++++++++W ++1++1P ++++ v + ++q ++ +C+ 
14273m, 149 GP-GRRTq AALL AF I WG YS AL A ALPL Y I LFR V VPQRLPGGD— QE I TPCT 195 

idfpeestasvstwlrsyvllstlvgF21PllvilvoYtrllrtlr. . . . 
+d+p++ + ++s+ +++ ++ Fl+P lvi++ Y+ 11 + + +++ 
14273m, 196 LDWPNRIG E I SWD VFFETLNFL VPGL V I V I S YSK I LQ I TKasr k 239 

'kaakt 1 1 vvvv vFv 1CW 1 Pyf i v 1 1 ldt- 
+ ^»f++'+ > *' 4W+*'++ +tl++++v F++ W P I ++11 + 

14273m, 240 r 1 1 1 s 1 ayseshq I r vsqqdyRLFRTLFLLM VSFF IMWSP 1 1 1 T I LL I L I 289 

c. la I ihestCelervl^ 
++ + + pi J: ++^+ + ++++N's+lNPl+Y 

14273m, 290 QnFR QDLVI WPSLFFWV VAFTFANS ALNP I LY 321 



FIG. 8. 
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Analysis of 14273m, (362 aa) 



Cys 

Ngly 

out 

TM 

ins 



PF00Q01 




r i 



5.1^3.3- 3.8" "4.5 



2.4" 



■6.5 3.2 1 



| i | i| i | i | i | i | i | i | i | i | i | I | i | I | i | l | . | i | 
1 41 81 121 161 201 241 281 321 361 



>14873i% 1086 bases, 6943 checksum 

MSPECAQTTGPCPSHTLDQVNRTHFPFFSDVKGDHRLVLSVVETTVLGL I F VVSLLGNVC 
ALVLVARRRRRGASASLVLNLFCADLLFTSAIPLVLVVRWTEAWLLGPVVCHLLFVVMTM 
SGS VT I LTL AA VSLERM VC I VRLRRGLSGPGRRTQAALL AF I VGYSALA ALPL Y I LFR VV 
PQRLPGGDQE I P I CTLDWPNR I GE I SWDVFFETLNFL VPGLV I V I SYSK ILQITKASRKR 
LTLSLAYSESHQIRVSQQDYRLFRTLFLLMVSfF^SP-I I ITILLILIQNFRQDLVI VP 
SLFFVVVAFTFANSALNPILYftM&tR^^^ 

FIG. 10. 
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Prosi-te Pa-Hern Matches for 14273n, 

>PS00001/PIIDC00001i ASN.GLYCDSYLATIDN N-glycosylation site. 

Query! 21 MRTH 24 
Queryi 322 NRTH 325 

>PSOQ00g/PDDCO0OO2/GLYCOSAHimGLYCAN Glycosanlnoglycan attachment site. 
RU Additional rulesi 

RU There oust be at least two acidic anlno adds (Glu or Asp) fron -2 to 
-4 relative to the serine. 

Query 148 SGP6 151 

>Pi0O0Ql/PDDCO0O04/CAMP.PHDSPHD_SITE cAMP- and cGMP-dependent protein kinase phosphorylation 
Query: 239 KRLT 242 

>PM5/PDDC00005/PKC.PHDSPHD.SITE Protein kinase C phosphorylation site. 

Queryi 237 SRK 239 
Query: 350 SVR 352 

>PSgo^/PDOIM)%/TO_PraSP^ 1 1 phosphorylation site. 

Queryi 40 SVVE 43 .. 
Queryi 256 SQQD 259 , . , , , , o ., , 

>PiQ0008/PDDC00008/HYRISTYL N-nyrlstoylat'lon sl'te. 

Queryi 57 GNVCAL 62 
Queryi 72 GASASL 77 
Queryi 343 GAIFTD 348 

>PiQO0O9/PDOCO0O09/AHIDATIDN Anldation site. 
Queryi 150 PGRR 153 



FIG. 11. 
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COPY OF PAPERS 
ORIGINALLY FILED 



Transmembrane Segments Predicted by MEMSAT 
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Score 
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1 98 


ins->out 
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| 113 


| 134 


out->ins 
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| 156 


1 177 


ins->out 


4.5 


209 


I 227 


out->ins 
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266 


| 289 


ins->out 


6.5 


297 


| 321 


out->ins 


3.2 



>14273n, 

MSPECAQTTGPGPSHTLDQVNRTHFPFFSDVKGDHRLVLSVVETTVLGL I FVVSLLGNVC 
AL VL VARRRRRG ASASL VLNLFCADLLFTS A I PL VL VVRWTE A WLLGPVVCHLLF Y VMTM 
SGS VT I LTLAAVSLERMVC I VRLRRGLSGPGRRTQAALLAF I WGYSALAALPLY I LFRVV 
PQRLPGGDQE I P I CTLDWPNR I GE I SWDVFFETLNFLVPGLV I V I SYSK I LQ I TKASRKR 
LTLSLAYSESHQ I R VSQQD YRLFRTLFLLM VSFF IMWSPIIITILLILI QNFRQDL V I VP 
SLFFWV V AFTF ANSALNP ILYMMSLFRNEWRK I FCCFFFPEKGA I FTDTS VRRNDLS V I S 
S 

Transmembrane Segments for Presumed Mature Peptide 



Start || End || Orient 


Score 


16 || 37 || ins->out 


3.3 


52 j| 73 ||,;Out-¥!ns',' 




| 95 || 116 || ins->out 


4.5 


1 148 || 166 ]| out->ins 


2:4 


1 1 205 || 228_|[jns->out 


6.5 


236 260 out->ins 


3.2 ,V:V 



>14273n, .nature 

LVLVARRRRRGASASLVLNLFCADLLFTSA I PL VL VVRWTEAWLLGPVVCHLLFYVMGMS 
GSVTILTLAAVSLERMVCIVRLRRGLSGPGRRTQAAI IAFIWGYSALAALPLYILFRVVP 
QRLPGGDQE I P I CTLDWPNR I GE I SWCVFFETLNFLVPGLV I V I SYSK I LQ I TKASRKRL 
TLSLAYSESKQ I RVSQQDYRLFRTLFLLMVSFFTMWSP 1 1 1 T I LL I L I QNFRQDL V I WPS 
LFFWVV AFTF ANSALNP ILYNMSLFRNEWRK I FCCFFFPEKGA I FTDTSVRRNDLS V I SS 



FIG. 12. 



